Intelligently Aiding Human-Guided Correction of Speech Recognition

نویسندگان

  • Keith Vertanen
  • Per Ola Kristensson
چکیده

Correcting recognition errors is often necessary in a speech interface. The process of correcting errors can not only reduce users’ performance, but can also lead to frustration. While making fewer recognition errors is undoubtedly helpful, facilities for supporting userguided correction are also critical. We explore how to better support user corrections using Parakeet – a continuous speech recognition system for text entry. Parakeet’s interface is designed for easy error correction on a mobile touch-screen device. Users correct errors by selecting alternative words from a word confusion network and by typing on a predictive software keyboard. Our interface design was guided by computational experiments and used a variety of information sources to aid the correction process. In user studies, participants were able to write text efficiently despite sometimes high initial recognition error rates. Using Parakeet as an example, we discuss principles we found were important for building an effective speech correction interface.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mabel: Building a Robot Designed for Human Interaction

Mabel (the Mobile Table) is a robotic system that can perform waypoint and vision guided navigation, speech generation, speech recognition, person finding, face finding, and face following. Mabel can interact intelligently with humans in two different settings: food and information serving. The robot’s architecture is flexible and easily adaptable to other tasks such as search and rescue.

متن کامل

Voice-based Age and Gender Recognition using Training Generative Sparse Model

Abstract: Gender recognition and age detection are important problems in telephone speech processing to investigate the identity of an individual using voice characteristics. In this paper a new gender and age recognition system is introduced based on generative incoherent models learned using sparse non-negative matrix factorization and atom correction post-processing method. Similar to genera...

متن کامل

A Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation

Abstract   Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...

متن کامل

Automatic Speech Recognition for Car Kits Using a Microphone Array

In this paper we present a novel solutions for microphone array based car kit systems that intelligently uses the multipath environment to enhance signal coming from a desired location. Our solution requires a low computational load, and can be deployed on most of the platforms. We present speech recognition rates on real data, and compare a stereo versus a mono solution on this database.

متن کامل

Automatic alignment and error correction of human generated transcripts for long speech recordings

In this paper we examine the issues of aligning and correcting approximate human generated transcripts for long audio files. Accurate time-aligned transcriptions help provide easier access to audio materials by aiding downstream applications such as the indexing, summarizing and retrieving of audio segments. Accurate time alignments are also necessary when incorporating audio data into the trai...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010